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I. Introduction 


Or THE LAST one hundred years, 
the average real return to stocks in 
the United States has been about six per- 
cent per year higher than that on Trea- 
sury bills. At the same time, the average 
real return on Treasury bills has been 
about one percent per year. In this pa- 
per, I discuss and assess various theore- 
tical attempts to explain these two differ- 
ent empirical phenomena: the large 
“equity premium” and the low “risk free 
rate.” I show that while there are several 
plausible explanations for the low level 
of Treasury returns, the large equity pre- 
mium is still largely a mystery to econo- 
mists. 

In order to understand why the sample 
means of the equity premium and the 
risk free rate represent “puzzles,” it is 
useful to review the basics of modern as- 
set pricing theory. It is well known that 
the real returns paid by different 
financial securities may differ consider- 
ably, even when averaged over long 
periods of time. Financial economists 
typically explain these differences in av- 
erage returns by attributing them to dif- 
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ferences in the degree to which a secu- 
rity’s return covaries with the typical in- 
vestor’s consumption. If this covariance 
is high, selling off the security would 
greatly reduce the variance of the typical 
investor's consumption stream; in equi- 
librium, the investor must be deterred 
from reducing his risk in this fashion by 
the security’s paying a high average return. 

There is a crucial problem in making 
this qualitative explanation of cross-sec- 
tional differences in asset returns opera- 
tional: what exactly is “the typical inves- 
tor’s consumption”? The famous Capital 
Asset Pricing Model represents one an- 
swer to this question: it assumes that the 
typical investor’s consumption stream is 
perfectly correlated with the return to 
the stock market. This allows financial 
analysts to measure the risk of a financial 
security by its covariance with the return 
to the stock market (that is, a scaled ver- 
sion of its “beta”). More recently, 
Douglas Breeden (1979) and Robert Lu- 
cas (1978) described so-called “repre- 
sentative” agent models of asset returns 
in which per capita consumption is per- 
fectly correlated with the consumption 
stream of the typical investor. (See also 
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Mark Rubinstein 1976; and Breeden and 
Robert Litzenberger 1978.) In this type 
of model, a security's risk can be mea- 
sured using the covariance of its return 
with per capita consumption. 

The representative agent model of as- 
set pricing is not as widely used as the 
CAPM in “real world” applications (such 
as project evaluation by firms). However, 
from an academic economist’s perspec- 
tive, the “representative agent” model of 
asset pricing is more important than the 
CAPM. Representative agent models 
that imbed the Lucas-Breeden paradigm 
for explaining asset return differentials 
are an integral part of modern macro- 
economics and international economics. 
Thus, any empirical defects in the repre- 
sentative agent model of asset returns 
represent holes in our understanding of 
these important subfields. 

In their seminal (1985) paper, Rajnish 
Mehra and Edward Prescott describe a 
particular empirical problem for the rep- 
resentative agent paradigm. As men- 
tioned above, over the last century, the 
average annual real return to stocks has 
been about seven percent per year, while 
the average annual real return to Trea- 
sury bills has been only about one per- 
cent per year. Mehra and Prescott (1985) 
show that the difference in the covari- 
ances of these returns with consumption 
growth is only large enough to explain 
the difference in the average returns if 
the typical investor is implausibly averse 
to risk. This is the equity premium puz- 
zle: in a quantitative sense, stocks are 
not sufficiently riskier than Treasury 
bills to explain the spread in their re- 
turns. 

Philippe Weil (1989) shows that the 
same data presents a second anomaly. 
According to standard models of individ- 
ual preferences, when individuals want 
consumption to be smooth over states 
(they dislike risk), they also desire 


smoothness of consumption over time 


(they dislike growth). Given that the 
large equity premium implies that inves- 
tors are highly risk averse, the standard 
models of preferences would in turn im- 
ply that they do not like growth very 
much. Yet, although Treasury bills offer 
only a low rate of return, individuals de- 
fer consumption (that is, save) at a suffi- 
ciently fast rate to generate average per 
capita consumption growth of around 
two percent per year. This is what Weil 
(1989) calls the risk free rate puzzle: al- 
though individuals like consumption to 
be very smooth, and although the risk 
free rate is very low, they still save 
enough that per capita consumption 
grows rapidly. 

There is now a vast literature that 
seeks to resolve these two puzzles. I be- 
gin my review of this literature by show- 
ing that the puzzles are very robust: they 
are implied by only three assumptions 
about individual behavior and asset mar- 
ket structure. First, individuals have 
preferences associated with the “stan- 
dard” utility function used in macro- 
economics: they maximize the expected 
discounted value of a stream of utilities 
generated by a power utility function. 
Second, asset markets are complete: indi- 
viduals can write insurance contracts 
against any possible contingency. Finally, 
asset trading is costless, so that taxes and 
brokerage fees are assumed to be insig- 
nificant. Any model that is to resolve the 
two puzzles must abandon at least one of 
these three assumptions. 

My survey shows that relaxing the 
three assumptions has led to plausible 
explanations for the low value of the risk 
free rate. Several alternative preference 
orderings are consistent with it; also, the 
existence of borrowing constraints push 
interest rates down in equilibrium. How- 
ever, the literature provides only two ra- 
tionalizations for the large equity pre- 
mium: either investors are highly averse 
to consumption risk or they find trading 
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stocks to be much more costly than trad- 
ing bonds. Little auxiliary evidence exists 
to support either of these explanations, 
and so (I would say) the equity premium 
puzzle is still unresolved. 

The Lucas-Breeden representative 
agent model is apparently inconsistent 
with the data in many other respects. 
Sanford Grossman and Robert Shiller 
(1981) (and many others) argue that as- 
set prices vary too much to be explained 
by the variation in dividends or in per 
capita consumption. Hansen and Ken- 
neth Singleton (1982, 1983) point out 
that in forming her portfolio, the repre- 
sentative investor appears to be ignoring 
useful information available in lagged 
consumption growth and asset returns. 
Robert Hall (1988) argues that consump- 
tion does not respond sufficiently to 
changes in expected returns. John Coch- 
rane and Hansen (1992) uncover several 
anomalies, including the so-called De- 
fault-Premium and Term Structure Puz- 
zles. 

This paper will look at none of these 
other puzzles. Instead, it focuses exclu- 
sively on the risk free rate and equity 
premium puzzles. There are two ratio- 
nalizations for limiting the discussion in 
this way. The first is simple: my conver- 
sations with other economists have con- 
vinced me that these puzzles are the 
most widely known and best under- 
stood of the variety of evidence usually 
arrayed against the representative agent 
models. 

The second rationalization is perhaps 
more controversial: I claim that these 
two puzzles have more importance for 
macroeconomists. The risk free rate 
puzzle indicates that we do not know 
why people save even when returns are 
low: thus, our models of aggregate sav- 
ings behavior are omitting some crucial 
element. The equity premium puzzle 
demonstrates that we do not know 
why individuals are so averse to the 


highly procyclical risk associated with 
stock returns. As Andrew Atkeson and 
Christopher Phelan (1994) point out, 
without this knowledge we cannot hope 
to give a meaningful answer to R. Lucas’ 
(1987) question about how costly in- 
dividuals find business cycle fluctua- 
tions in consumption growth. Thus, 
the two puzzles are signs of large gaps 
in our understanding of the macroecon- 
omy. 

Given the lack of a compelling expla- 
nation for the large equity premium, the 
article will at times read like a litany of 
failure. I should say from the outset that 
this is misleading in many ways. We have 
learned much from the equity premium 
puzzle literature about the properties of 
asset pricing models, about methods of 
estimating and testing asset pricing mod- 
els, and about methods of solving for the 
implications of asset pricing models. I 
believe that all of these contributions are 
significant. 

For better or for worse, though, this 
article is not about these innovations. In- 
stead, I focus on what might be called 
the bottom line of the work, “Do we 
know why the equity premium is so 
high? Do we know why the risk free rate 
is so low?” It is in answering the first of 
these questions that the literature falls 
short. 

The rest of the paper is structured as 
follows. The next section describes the 
two puzzles and lays out the fundamental 
modelling assumptions that generate 
them. Section III explores the potential 
for explaining the two puzzles by chang- 
ing the preferences of the representative 
agent. Section IV looks at the implica- 
tions of market frictions for asset re- 
turns. Section V concludes.! 


l In writing this paper, I have benefited greatly 
from reading four other review articles by Andrew 
Abel (1991), Aiyagari (1993), Cochrane and Han- 
sen (1992), and Heaton and D. Lucas (1995b). I 
recommend all of them highly. 
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Figure 1. Annual Real Per Capita Consumption Growth 


II. What Are the Puzzles? 
1. Aspects of the Data 


The equity premium and risk free 
rate puzzles concern the co-movements 
of three variables: the real return to 
the S & P 500, the real return to short 
term nominally risk free bonds,? and 
the growth rate of per capita real con- 
sumption (more precisely, nondurables 
and services). In this paper, I use the 
annual United States data from 1889- 
1978 originally studied by Mehra and 
Prescott (1985) (see the Appendix for 
details on the construction of the data).° 


2Ninety day Treasury bills from 1931-1978, 
Treasury certificates from 1920-31, and 60-day to 
90-day Commercial Paper prior to 1920. 

3 In what follows, I use only the Mehra-Prescott 
data set. However, it is important to realize that 
the puzzles are not peculiar to these data. For ex- 
ample, Kocherlakota (1994) shows that adding ten 
more years of data to the Mehra-Prescott series 


Plots of the data are depicted in Figures 
1-3; all of the series appear station- 
ary and ergodic (their statistical proper- 
ties do not appear to be changing over 
time). 

Table 1 contains some summary statis- 
tics for the three variables. There are 
three features of the table that give rise 
to the equity premium and risk free rate 
puzzles. First, the average real rate of re- 
turn on stocks is equal to seven percent 


does not eliminate the puzzles. Hansen and Sin- 
gleton (1983) and Aiyagari (1993) find that similar 
phenomena characterize post-World War H 
monthly data in the United States. Amlan Roy 
(1994) documents the existence of the two puzzles 
in post-World War II quarterly data in Germany 
and Japan. It may not be too strong to say that the 
equity premium and risk free rate puzzles appear 
to be a general feature of organized asset markets. 
(Jeremy Siegel, 1992, points out that the equity 
premium was much lower in the 19th century. 
However, consumption data is not available for 


this period.) 
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Figure 3. Annual Real Return to Nominally Risk Free Short Term Debt 
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TABLE 1 
SUMMARY STATISTICS 
UNITED STATES ANNUAL Data, 1889-1978 


Sample Means 


Rs 0.070 
R? 0.010 
C:/Ci-1 0.018 
Sample Variance-Covariance 

R R? Ci/Ca 
Rs 0.0274 0.00104 0.00219 
Re 0.00104 0.00308 —0.000193 
C/Cia 0.00219 —0.000193 0.00127 


In this table, C:/C;1 is real per capita consumption 
growth, R: is the real return to stocks and R? is the real 
return to Treasury bills. 


per year while the average real rate of 
return on bonds is equal to one percent 
per year. Second, by long-term historical 
standards, per capita consumption 
growth is high: around 1.8 percent per 
year. Finally, the covariance of per cap- 
ita consumption growth with stock re- 
turns is only slightly bigger than the co- 
variance of per capita consumption 
growth with bond returns. 

In some sense, there is a simple expla- 
nation for the higher average return of 
stocks: stock returns covary more with 
consumption growth than do Treasury 
bills. Investors see stocks as a poorer 
hedge against consumption risk, and so 
stocks must earn a higher average return. 
Similarly, while individuals must have 
saved a lot to generate the high con- 
sumption growth described in Table 1, 
there is an explanation of why they did 
so: as long as individuals discount the 
future at a rate lower than one percent 
per year, we should expect their con- 
sumptions to grow. The data does not 
contradict the qualitative predictions of 
economic theory; rather, as we shall see, 
it is inconsistent with the quantitative 
implications of a particular economic 
model. 


2. The Original Statement of the Puzzles 


In their 1985 paper, Mehra and 
Prescott construct a simple model that 
makes well-defined quantitative predic- 
tions for the expected values of the real 
returns to the S & P 500 and the three 
month Treasury bill rate. There are 
three critical assumptions underlying 
their model. The first is that in period 
t, all individuals have identical prefer- 
ences over future random consumption 
streams represented by the utility func- 
tion: 


E, 22.9 Bleu) A-0), a = 0 (1) 


where {c,}%, is a random consumption 
stream. In this expression, and through- 
out the remainder of the paper, E; repre- 
sents an expectation conditional on infor- 
mation available to the individual in 
period t. Thus, individuals seek to maxi- 
mize the expectation of a discounted 
flow of utility over time. 

In the formula (1), the parameter B is 
a discount factor that households apply 
to the utility derived from future con- 
sumption; increasing B leads investors to 
save more. In contrast, increasing œŒ has 
two seemingly distinct implications for 
individual attitudes toward consumption 
profiles. When @ is large, individuals 
want consumption in different states to 
be highly similar: they dislike risk. But 
individuals also want consumption in dif- 
ferent dates to be similar: they dislike 
growth in their consumption profiles. 

The second key assumption of the 
Mehra-Prescott model is that people can 
trade stocks and bonds in a frictionless 
market. What this means is that an indi- 
vidual can costlessly (that is, without sig- 
nificant taxes or brokerage fees) buy and 
sell any amount of the two financial as- 
sets. In equilibrium, an individual must 
not be able to gain utility at the margin 
by selling bonds and then investing the 
proceeds in stocks; the individual should 
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not be able to gain utility by buying or 
selling bonds. Hence, given the costless- 
ness of performing these transactions, an 
individual’s consumption profile must 
satisfy the following two first order con- 
ditions: 


E Alene) “(Ri — RP..)} =0. (2a) 


BE, MCCn) Rpa) = 1. (2b) 


Intheseconditions,R¿ is the gross return 
to stocks from period (t — 1) to period t, 
while R? is the gross return to bonds 
from period (t — 1) to period t. 

These first order conditions impose 
statistical restrictions on the comove- 
ment between any person’s pattern of 
consumption and asset returns. The third 
critical feature of Mehra and Prescott’s 
analysis, though, is that they assume the 
existence of a “representative” agent. Ac- 
cording to this assumption, the above 
conditions (2a) and (2b) are satisfied not 
just for each individual’s consumption, 
but also for per capita consumption. 

There is some confusion about what 
kinds of assumptions underlie this substi- 
tution of per capita for individual con- 
sumption. It is clear that if all individuals 
are identical in preferences and_ their 
ownership of production opportunities, 
then in equilibrium, all individuals will 
in fact consume the same amount; thus, 
under these conditions, substituting per 
capita consumption into conditions (2a) 
and (2b) is justified. Many economists 
distrust representative agent models be- 
cause they believe this degree of homo- 
geneity is unrealistic. 

However, while this degree of homo- 
geneity among individuals is sufficient to 
guarantee the existence of a repre- 
sentative agent, it is by no means neces- 
sary. In particular, George Constan- 
tinides (1982) shows that even if 
individuals are heterogeneous in prefer- 
ences and levels of wealth, it may be pos- 
sible to find some utility function for the 


representative individual (with coeffi- 
cient of relative risk aversion no larger 
than the most risk averse individual and 
no smaller than the least risk averse indi- 
vidual) which satisfies the first order 
conditions (2a) and (2b). The key is that 
asset markets must be complete: indi- 
viduals in the United States must have a 
sufficiently large set of assets available 
for trade that they can diversify any idio- 
syncratic risk in consumption. When 
markets are complete, we can construct 
a “representative” agent because after 
trading in complete markets, individuals 
become marginally homogeneous even 
though they are initially heterogeneous. 

We can summarize this discussion as 
follows: Mehra and Prescott assume that 
asset markets are frictionless, that asset 
markets are complete, and that the resul- 
tant representative individual has prefer- 
ences of the form given by (1). However, 
Mehra and Prescott also make three 
other, more technical, assumptions. 
First, they assume that per capita con- 
sumption growth follows a two state 
Markov chain constructed in such a way 
that the population mean, variance, and 
autocorrelation of consumption growth 
are equivalent to their corresponding 
sample means in United States data. 
They also assume that in period t, the 
only variables that individuals know are 
the realizations of current and past con- 
sumption growth. Finally, they assume 
that the growth rate of the total divi- 
dends paid by the stocks included in the 
S & P 500 is perfectly correlated with 
the growth rate of per capita consump- 
tion, and that the real return to the 
(nominally risk free) Treasury bill is per- 
fectly correlated with the return to a 
bond that is risk free in real terms. 

These statistical assumptions allow 
Mehra and Prescott to use the first order 
conditions (2a) and (2b) to obtain an 
analytical formula that expresses the 
population mean of the real return to the 
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S & P 500 and the population mean real 
return to the three month Treasury bill 
in terms of the two preference parame- 
ters B and a. Using evidence from mi- 
croeconometric data and introspection, 
they restrict B to lie between 0 and 1, 
and & to lie between 0 and 10. Their 
main finding is that for any value of the 
preference parameters such that the ex- 
pected real return to the Treasury bill is 
less than four percent, the difference in 
the two expected real returns (the “eq- 
uity premium”) is less than 0.35 percent. 
This stands in contrast to the facts de- 
scribed earlier that the average real re- 
turn to Treasury bills in the United 
States data is one percent while the aver- 
age real return to stocks is nearly six per- 
cent higher. 

Mehra and Prescott conclude from 
their analysis that their model of asset 
returns is inconsistent with United States 
data on consumption and asset returns. 
Because the model was constructed by 
making several different assumptions, 
presumably the fit of the model to the 
data could be improved by changing any 
of them. In the next section, I restate 
the equity premium puzzle to show 
why the three final assumptions about 
the statistical behavior of consumption 
and asset returns are relatively unimpor- 
tant. 


3. A More Robust Restatement of the 
Puzzles 


As we saw above, the crux of the 
Mehra-Prescott model is that the first 
order conditions (2a) and (2b) must be 
satisfied with per capita consumption 
growth substituted in for individual con- 
sumption growth. Using the Law of Iter- 
ated Expectations, we can replace the 
conditional expectation in (2a) and (2b) 
with an unconditional expectation so 
that: 


ECC (Bia Rba) =0 (2a) 


BE((Cj4/C,)-*R?, 1} = 1. (2b’) 


(The variable C; stands for per capita 
consumption.) We can estimate the ex- 
pectations or population means on the 


left hand side of (2a’) and (2b’) using the 


sample means of: 
Cf = (Cy/C)- (RE — RE} 


eb. = B((Cy/C,) RP, 1} 

given a sufficiently long time series of 
data on C41 and the asset returns.4 

Table 2 reports the sample mean of ej 
for various values of a; for all values of a 
less than or equal to 8.5, the sample 
mean of ej is significantly positive. We 
can conclude that the population mean 
of ej is in fact positive for any value of a 
less than or equal to 8.5; hence, for any 
such a, the representative agent can gain 
at the margin by borrowing at the Treas- 
ury bill rate and investing in stocks.° This 


4Throughout this paper, I will be evaluating 
various representative agent models by using ver- 
sions of (2a’) and (2b’) as opposed to (2a) and 
(2b). While (2a,b) certainly imply (2a’,b’), there 
are a host of other implications of (2a,b): the law 
of iterated expectations “averages” over all of the 
implications of (2a) and (2b) to arrive at (2a’) and 
(2b’). Focusing on (2a) and (2b) is in keeping with 
my announced goal in the introduction of just 
looking at the equity premium and risk free rate 
puzzles. 

Hansen and Jagannathan (1991) and Cochrane 
and Hansen (1992) reinterpret the equity pre- 
mium puzzle and the risk free rate puzzles using 
the variance bounds derived in the former paper. 
Note that, as Cochrane and Hansen (1992) empha- 
size, the variance bound is an even weaker impli- 
cation of (2a) than (2a’) is (in the sense that (2a’) 
implies the variance bound while the converse is 
not true). 

5 Breeden, Michael Gibbons, and Litzenberger 
(1989) and Grossman, Angelo Melino, and Shiller 
(1987) point out that the first order conditions 
(2a) and (2b) are based on the assumption that all 
consumption within a given year is perfectly sub- 
stitutable; if this assumption is false, there is the 
possibility of time aggregation bias. However, 
Hansen, Heaton, and Amir Yaron (1994) show that 
as long as the geometric average of consumption 
growth rates is a good approximation to the arith- 
metic average of consumption growth rates, the 
presence of time aggregation should only affect 
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TABLE 2 
THE EQUITY PREMIUM PUZZLE 


a e t-stat 
0.0 0.0594 3.345 
0.5 0.0577 3.260 
1.0 0.0560 3.173 
1.5 0.0544 3.082 
2.0 0.0528 2.987 
2.5 0.0512 2.890 
3.0 0.0496 2.790 
3.5 0.0480 2.688 
4.0 0.0464 2.584 
4.5 0.0449 2.478 
5.0 0.0433 2.370 
5.5 0.0418 2.262 
6.0 0.0403 2.153 
6.5 0.0390 2.044 
7.0 0.0372 1.934 
7.5 0.0357 1.824 
8.0 0.0341 1.715 
8.5 0.0326 1.607 
9.0 0.0310 1.501 
9.5 0.0295 1.395 
10.0 0.0279 1.291 


In this table, @ is the sample mean of e = (Cr1/C)* 
(Rs,, — Ra) and & is the coefficient of relative risk 
aversion. Standard errors are calculated using the 
implication of the theory that e: is uncorrelated with erx 
for all k; however, they are little changed by allowing e: 
to be MA(1) instead. This latter approach to calculating 
standard errors allows for the possibility of time 
aggregation (see Hansen, Heaton, and Yaron 1994). 


is the equity premium puzzle. Intui- 
tively, while the covariance between 
stock returns and per capita consump- 
tion growth is positive (see Table 1), it is 
not sufficiently large to deter the repre- 
sentative investor with a coefficient of 
relative risk aversion less than 8.5 from 
wanting to borrow and invest in stocks. 
Note that the marginal benefit of selling 
Treasury bills and buying stocks falls as 


Tables 2-3 through the calculation of the standard 
errors. Using their suggested correction makes 
little difference to the f-statistics reported in 
either Table. (Indeed, Corr(e;, e;.4) is small for all 
k<10.) 


the investors degree of risk aversion 
rises.6 

Table 3 reports the sample mean of ep 
for various values of a and B = 0.99. It 
shows that for B equal to 0.99 and o 
greater than one, the representative 
household can gain at the margin by 
transferring consumption from the fu- 
ture to the present (that is, reducing its 
savings rate). This is the risk free rate 
puzzle. High values of œ imply that indi- 
viduals view consumption in different 
periods as complementary. Such indi- 
viduals find an upwardly sloped con- 
sumption profile less desirable because 
consumption is not the same in every pe- 
riod of life. As a result, an individual 
with a high value of & realizes more of a 
utility gain by reducing her savings rate. 
Note that the risk free rate puzzle comes 
from the equity premium puzzle: there is 
a risk free rate puzzle only if @ is re- 
quired to be larger than one so as to 
match up with the high equity premium.” 


6 This intuition becomes even more clear if one 
assumes that In(Ci41/C;), In(R$,;), and In(R?,,) are 
jointly normally distributed. Then, equations (2a) 
and (2b) become: 


E(rt — rt) — aCov(grrit-r?) 
+0.5Var(r}) + 0.5Var(r?) =0 (2a) 
In(B) — WE(g:) + E(r?) + 0.5(02Var(gr) 
— 2aCov(gr,r?) + Var(r?)} =0 (2B) 


where (gs r$, r?) = (In(C/C;-1), In(R¥), In(R?)). In the 
data, Cov(g:,r7) is essentially zero. Because 
E(ri—r?) is so large, either & or Cov(g,r7) must be 
large in order to satisfy (2a). 

As with (2a) and (2b), the sample analogs of (2a) 
and (2b) can only be satisfied by setting B equal to 
a value larger than one and @ equal to a value 
greater than 15. See N. Gregory Mankiw and 
Stephen Zeldes (1991) for more details. 

7 See Edward Allen (1990), Cochrane and Han- 
sen (1992), and Constantinides (1990) for similar 
presentations of the two Pe 

Stephen Cecchetti, Pok-sang Lam, and Nelson 
Mark (1993) present the results of joint tests of 
(2a’,b’). I present separate t-tests because I think 
doing so provides more intuition about the failings 
of the representative agent model. Of course, 
there is nothing wrong statistically with doing two 
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TABLE 3 
THE RISK FREE RATE PUZZLE 


a e t-stat 
0.0 0.0033 0.0567 
0.5 —0.0081 —1.296 
1.0 —0.0162 —2.233 
1.5 —0.0239 —2.790 
2.0 —0.0313 -3.100 
2.5 —0.0382 -3.263 
3.0 —0.0448 -3.339 
3.5 —0.0510 -3.360 
4.0 —0.0569 -3.348 
4.5 —0.0624 -3.312 
5.0 —0.0675 -3.259 
5.5 —0.0723 -3.195 
6.0 —0.0768 -3.123 
6.5 —0.0808 -3.043 
7.0 —0.0846 -2.959 
7.5 —0.0880 —2.871 
8.0 —0.0910 —2.779 
8.5 —0.0937 -2.685 
9.0 —0.0960 -2.590 
9.5 —0.0980 —2.492 
10.0 —0.0997 —2.394 


In this table, Z is the sample mean of e, =B(cm1/ci)* 
(Rẹ? — 1) and & is the coefficient of relative risk 
aversion. The discount factor B is set equal to 0.99. The 
standard errors are calculated using the implication of 
the theory that e, is uncorrelated with ek for all k; 
however, they are little changed by allowing e to be 
MA(1) instead. This latter approach to calculating 
standard errors allows for the possibility of time 
aggregation (see Hansen, Heaton, and Yaron 1994). 


It is possible to find parameter set- 
tings for the discount factor B and the 
coefficient of relative risk aversion o that 
exactly satisfy the sample versions of 
equations (2a’) and (2b’). In particular, 
by setting & = 17.95, it is possible to sat- 
isfy the equity premium first order con- 
dition (2a’); by also setting B = 1.08, it is 
possible to satisfy the risk free rate first 


separate t-tests, each with size a, as long as one 
keeps in mind that the size of the overall test 
aula be as low as zero or it could be as high as 
2a. (There may be a loss of power associated with 
using sequential t-tests but power does not seem to 
be an issue in tests of representative agent models!) 


order condition (2b’). (See Kocherlakota, 
1990a, for a demonstration of how com- 
petitive equilibria can exist even when B 
> 1.) It is necessary to set @ to such a 
large value because stocks offer a huge 
premium over bonds, and aggregate con- 
sumption growth does not covary greatly 
with stock returns; hence, the repre- 
sentative investor can be marginally in- 
different between stocks and bonds only 
if he is highly averse to consumption 
risk. Similarly, the high consumption 
growth enjoyed by the United States 
since 1890 can be consistent with high 
values of & and the risk free rate only if 
the representative investor is so patient 
that his “discount” factor B is greater 
than one. 

Thus, the equity premium and risk 
free rate puzzles are solely a product of 
the parametric restrictions imposed by 
Mehra and Prescott on the discount fac- 
tor B and the coefficient of relative risk 
aversion a. Given this, it is important to 
understand the sources of these restric- 
tions. The restriction that the discount 
factor B is less than one emerges from 
introspection: most economists, includ- 
ing Mehra and Prescott, believe that an 
individual faced with a constant con- 
sumption stream would, on the margin, 
like to transfer some consumption from 
the future to the present. 

The restriction that œ should be less 
than ten is more controversial. Mehra 
and Prescott (1985) quote several micro- 
econometric estimates that bound «a 
from above by three. Unfortunately, the 
only estimate that they cite from finan- 
cial market data has been shown to be 
severely biased downwards (Kocherlak- 
ota 1990c). In terms of introspection, it 
has been argued by Mankiw and Zeldes 
(1991) that an individual with a coeffi- 
cient of relative risk aversion above ten 
would be willing to pay unrealistically 
large amounts to avoid bets. However, 
they consider only extremely large bets 
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(ones with potential losses of 50 percent 
of the gambler’s wealth). In contrast, 
Shmuel Kandel and Robert Stambaugh 
(1991) show that even values of & as high 
as 30 imply quite reasonable behavior 
when the bet involves a maximal poten- 
tial loss of around one percent of the 
gambler’s wealth. 

Because of the arguments offered by 
Kandel and Stambaugh (1991) and Ko- 
cherlakota (1990c), some economists 
(see, among others, Craig Burnside 
1994; Campbell and Cochrane 1995; 
Cecchetti and Mark 1990; Cecchetti, 
Lam, and Mark 1993; and Hansen, 
Thomas Sargent, and Thomas Tallarini 
1994) believe that there is no equity pre- 
mium puzzle: individuals are more risk 
averse than we thought, and this high de- 
gree of risk aversion is reflected in the 
spread between stocks and bonds. How- 
ever, it is clear to me from conversations 
and from knowledge of their work that a 
vast majority of economists believe that 
values for œ above ten (or, for that mat- 
ter, above five) imply highly implausible 
behavior on the part of individuals. (For 
example, Mehra and Prescott (1985, 
1988) clearly chose an upper bound as 
large as ten merely as a rhetorical flour- 
ish.) D. Lucas (1994, p. 335) claims that 
any proposed solution that “does not ex- 
plain the premium for@<2.5...is... 
likely to be widely viewed as a resolution 
that depends on a high degree of risk 
aversion.” She is probably not exaggerat- 
ing the current state of professional 
opinion. 

Tables 2 and 3 show that any model 
that leads to (2a’,b’) will generate the 
two puzzles. This helps make clear what 
the puzzles are not about. For example, 
while Mehra and Prescott ignore sam- 
pling error in much of their discussion, 
Tables 2 and 3 do not. This allays the 
concerns expressed by Allan Gregory and 
Gregor Smith (1991) and Cecchetti, 
Lam, and Mark (1993) that the puzzles 


could simply be a result of sampling er- 
ror. Similarly, Mehra and Prescott as- 
sume that the growth rate of the divi- 
dend to the S & P 500 is perfectly 
correlated with the growth rate of per 
capita consumption. Simon Benninga 
and Aris Protopapadakis (1990) argue 
that this might be responsible for the 
conflict between the model and the data. 
However, in Tables 2 and 3, the equity 
premium and risk free rate puzzles are 
still present when the representative in- 
vestor is faced with the real return to the 
S & P 500 itself, and not some imaginary 
portfolio with a dividend that is perfectly 
correlated with consumption. 

Finally, we can see that the exact na- 
ture of the process generating consump- 
tion growth is irrelevant. For example, 
Cecchetti, Lam, and Mark (1993), and 
Kandel and Stambaugh (1990) have both 
proposed that allowing consumption 
growth to follow a Markov switching pro- 
cess (as described by James Hamilton 
1989) might explain the two puzzles. Ta- 
bles 2 and 3, though, show that this con- 
jecture is erroneous:® the standard t-test 
is asymptotically valid when consumption 
growth follows any stationary and er- 
godic process, including one that is 
Markov switching.’ 


8 See Abel (1994) for another argument along 
these lines. 

9 Some readers might worry that the justifica- 
tion for Tables 2-3 is entirely asymptotic. I am 
sympathetic to this concern. I have run a Monte 
Carlo experiment in which the pricing errors et 
and ep are assumed to be i.i.d. over time and have 
a joint distribution equal to their empirical joint 
distribution. In this setting, the Monte Carlo re- 
sults show that the asymptotic distribution of the 
t- statistic and its distribution with 100 observa- 
tions are about the same. 

However, there is a limit to the generality of 
Tables 2 and 3. One could posit, as in Kocher- 
lakota (1994) and Thomas Rietz (1988), that there 
is a low probability of consumption falling by an 
amount that has never been seen in United States 
data. With power utility, investors are very averse 
to these large falls in consumption, even if they 
are highly unlikely. Hence, investors believe that 
stocks are riskier and bonds are more valuable 
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Thus, Tables 2 and 3 show that there 
are really only three crucial assumptions 
generating the two puzzles. First, asset 
markets are complete. Second, asset 
markets are frictionless. (As discussed 
above, these two assumptions imply that 
there is a representative consumer.) Fi- 
nally, the representative consumer’s util- 
ity function has the power form assumed 
by Mehra and Prescott (1985) and satis- 
fies their parametric restrictions on the 
discount factor and coefficient of relative 
risk aversion. Any model that seeks to re- 
solve the two puzzles must weaken at 
least one of these three assumptions. 


III. Preference Modifications 


The paradigm of complete and fric- 
tionless asset markets underlies some of 
the fundamental insights in both finance 
and economics. (For example, the 
Modigliani-Miller Theorems and the ap- 
peal of profit maximization as an objec- 
tive for firms both depend on the com- 
pleteness of asset markets.) Hence, it is 
important to see whether it is possible to 
explain the two puzzles without aban- 
doning this useful framework. To do so, 
we must consider possible alterations in 
the preferences of the representative in- 
dividual; in this section, I consider three 
different modifications to the prefer- 
ences (1): generalized expected utility, 
habit formation, and relative consump- 
tion effects. 


1. Generalized Expected Utility 


In their (1989) and (1991) articles,!° 
Larry Epstein and Stanley Zin describe a 


than appears to be true when one looks at infor- 
mation in the 90-year United States sample; the 
t-statistics are biased upwards. While this explana- 
tion is consistent with the facts, it has the teres 
of being largely nontestable using only returns and 
consumption dee (See Mehra and Prescott, 1988, 
for another critique of this argument.) 

10 Weil (1989), Kandel and Stambaugh (1991), 
and Hansen, Sargent, and Tallarini (1994) provide 


generalization of the “standard” prefer- 
ence class (1). In these Generalized Ex- 
pected Utility preferences, the period t 
utility U, received by a consumer from a 
stream of consumption is described re- 
cursively using the formula: 


U= {er + BE unr JOPO) (3) 


Thus, utility today is a constant elasticity 
function of current consumption and fu- 
ture utility. Note that (1) can be ob- 
tained as a special case of these prefer- 
ences by setting @ = p. 

Epstein and Zin point out a crucial 
attribute of these preferences. In the 
preferences (1), the coefficient of rela- 
tive risk aversion @ is constrained to be 
equal to the reciprocal of the elasticity 
of intertemporal substitution. Thus, 
highly risk averse consumers must view 
consumption in different time periods 
as being highly complementary. This is 
not true of the GEU preferences (3). In 
this utility function, the degree of risk 
aversion of the consumer is governed 
by a while the elasticity of inter- 
temporal substitution is governed by 
1/p. Epstein and Zin (1991) argue that 
disentangling risk aversion and inter- 
temporal substitution in this fashion 
may help explain various aspects of as- 
set pricing behavior that appear anoma- 
lous in the context of the preferences 
(1). 

To see the usefulness of GEU prefer- 
ences in understanding the equity pre- 
mium puzzle and the risk free rate puz- 
zle, suppose there is a representative 
investor with preferences given by (3) 
who can invest in stocks and bonds. 
Then, the investor’s optimal consump- 


complementary analyses of the implications of this 
type of generalized expected utility for asset pric- 
ing. 

“There are other ways to generalize expected 
utility so as to generate interesting implications for 
asset prices—see Epstein and Tan Wang (1994) 
and Epstein and Zin (1990) for examples. 
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tion profile must satisfy the two first or- 
der conditions: 


EAUP GHC y41/C) PRE — RP)} =0 (4a) 


BE H(E, UF Py) (O-py.—-a) 
UP (C141/C,) PRP} = 1. (4b) 


Note that the marginal rates of substitu- 
tion depend on a variable that is funda- 
mentally unobservable: the period (t + 1) 
level of utility of the representative 
agent (and the period t expectations of 
that utility). This unobservability makes 
it difficult to verify whether the data are 
consistent with these conditions. 

To get around this problem, we can 
exploit the fact that it is difficult to pre- 
dict future consumption growth using 
currently available information (see Hall 
1978). Hence, I assume in the discussion 
of GEU preferences that future con- 
sumption growth is statistically indepen- 
dent of all information available to the 
investor today.!! Under this restriction, 
it is possible to show that utility in pe- 
riod (¢ + 1) is a time and state invariant 
multiple of consumption in period (t + 
1). We can then apply the Law of Iter- 
ated Expectations to (4a) and (4b) to ar- 
rive at: 


BECC aC (Ri, — Re} =9  (4a’) 
BLE(C,4/C,)1-#] Pv) 
E((Cy/C,)-°R?,,} = 1. (4b’) 


See Kocherlakota (1990b) for a precise 
derivation of these first order condi- 
tions. 12 


11 This assumption of serial independence is 
also emplo ed by Abel (1990), Epstein and Zin 
(1990), td Campbell and Cochrane (1995). Kan- 
a and aa (1991), Kocherlakota (1990b), 
and Weil (1989) have shown that the thrust of the 
discussion that follows is little affected by allowing 
for more realistic amounts of dependence in con- 
sumption growth. 

12 Epstein and Zin (1991) derive a different rep- 
resentation for the first order conditions (4a) and 
(4b) of the representative investor: 


The first order condition (4a’) tells us 
that using GEU preferences will not 
change Table 2: for each value of a, the 
marginal utility gain of reducing bond 
holdings and investing in stocks is the 
same whether the preferences lie in the 
expected utility class (1) or in the gener- 
alized expected utility class (3). In con- 
trast, Table 4 shows that it is possible to 
resolve the risk free rate puzzle using 
the GEU preferences. In particular, for 
B = 0.99, and various values of o in the 
set (0, 18], Table 4 presents the value of 
p that exactly satisfies the first order 
condition (4b). 

The intuition behind these results is 
simple. The main benefit of modelling 
preferences using (3) instead of (1) is 
that individual attitudes toward risk and 
growth are no longer governed by the 
same parameter. However, the equity 
premium puzzle arises only because of 
economists’ prior beliefs about risk aver- 
sion; hence, this puzzle cannot be re- 
solved by disentangling attitudes toward 
risk and growth. On the other hand, the 
connection between risk aversion and in- 
tertemporal substitution in the standard 
preferences (1) is the essential element 


ECHC (RP) Rin — Ress)}=0 (x) 
EAB CH CN PREDORA} + 1 (ax) 


where y = (1 — a@)/(1 — p). In (*) and (**), R? is 
the gross real return to the representative inves- 
tors entire portfolio of assets (including human 
capital, housing, etc.). This ap aa has the 
desirable attribute of being valid regardless of the 
information set of the representative investor (so 
we don’t have to assume that consumption growth 
from period t to period (t + 1) is independent of 
all information available to the investor in period 
t). Unfortunately, the variable Rọ’ is not observ- 
able. Epstein and Zin (1991) use the value- 
weighted return to the NYSE as a proxy, but of 
course this understates the true level of diversifi- 
cation of the representative investor and (poten- 
tially) greatly overstates the covariability of her 
marginal rate of substitution with the return to the 
stock market. As a result, using this proxy variable 
leads one to the spurious conclusion that GEU 
preferences can resolve the equity premium puz- 
zle with low levels of risk aversion. 
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TABLE 4 
RESOLVING THE RISK FREE RATE PUZZLE 
WITH GEU PREFERENCES 


a V/a 1/p 
0.500 2.000 23.93 
1.000 1.000 14.91 
1.500 0.6667 10.62 
2.000 0.5000 8.126 
2.500 0.4000 6.493 
3.000 0.3333 5.345 
3.500 0.2857 4.494 
4.000 0.2500 3.840 
4.500 0.2222 3.322 
5.000 0.2000 2.903 
5.500 0.1818 2.557 
6.000 0.1667 2.267 
6.500 0.1538 2.022 
7.000 0.1429 1.811 
7.500 0.1333 1.629 
8.000 0.1250 1.469 
8.500 0.1176 1.330 
9.000 0.1111 1.206 
9.500 0.1053 1.096 

10.00 0.1000 0.9974 
10.50 0.09524 0.9091 
11.00 0.09091 0.8295 
11.50 0.08696 0.7574 
12.00 0.08333 0.6920 
12.50 0.08000 0.6325 
13.00 0.07692 0.5781 
13.50 0.07407 0.5283 
14.00 0.07143 0.4826 
14.50 0.06897 0.4406 
15.00 0.06667 0.4018 
15.50 0.06452 0.3660 
16.00 0.06250 0.3329 
16.50 0.06061 0.3022 
17.00 0.05882 0.2737 
17.50 0.05714 0.2472 
18.00 0.05556 0.2226 


In this table, the first column is the coefficient of 
relative risk aversion a, the second column is the 
corresponding elasticity of intertemporal substitution if 
preferences are of the form (1), and the third column is 
the value of 1/p that satisfies the risk free rate first 
order condition (4b’) given that preferences are of the 
form (3), B = 0.99 and @ has the value specified in the 
first column. 


of the risk free rate puzzle. To match 
the equity premium, risk aversion has to 
be high. But in the standard prefer- 


ences (1), this forces intertemporal 
substitution to be low. The GEU prefer- 
ences can explain the risk free rate 
puzzle by allowing intertemporal substi- 
tution and risk aversion to be high simul- 
taneously.!3 


2. Habit Formation 


We have seen that the risk free rate 
puzzle is a consequence of the demand 
for savings being too low when individu- 
als are highly risk averse and have pref- 
erences of the form (1). The GEU pref- 
erence class represents one way to 
generate high savings demand when indi- 
viduals are highly risk averse. There is 
another, perhaps more intuitive, ap- 
proach. The standard preferences (1) as- 
sume that the level of consumption in 
period (t — 1) does not affect the mar- 
ginal utility of consumption in period t. 
It may be more natural to think that an 
individual who consumes a lot in period 
(t — 1) will get used to that high level of 
consumption, and will more strongly 
desire consumption in period t; mathe- 
matically, the individual’s marginal util- 
ity of consumption in period ¢ is an 
increasing function of period (t —1) con- 
sumption. 

This property of intertemporal prefer- 
ences is termed habit formation. The ba- 
sic implications of habit formation for 
asset returns (as explained by Constan- 


13Weil (1989) emphasizes that GEU prefer- 
ences cannot explain the risk free rate puzzle if 
one forces the coefficient of relative risk aversion 
to be “reasonable” (say, approximately 1). He 
shows that under this restriction, then the elastic- 
ity of intertemporal substitution must be very 
large to be consistent with the risk free rate. (See 
Table 4 for a confirmation.) This large elasticity of 
intertemporal substitution appears to contradict 
the results of Hall (1988), who shows that the esti- 
mated slope coefficient is very small in a regres- 
sion of consumption growth on expected real in- 
terest rates (although Weil, 1990, points out that it 
is difficult to see a direct linkage between Hall’s 
regression coefficients and the parameters that de- 
scribe the GEU preferences). 
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tinides 1990; and Heaton 1995) can be 
captured in the following simple model. 
Suppose that there is a representative 
agent with preferences in period t repre- 
sented by the utility function: 


E; X50 B' (Crs = owen T° — &),A > 0. (5) 


Thus, the agent’s momentary utility is a 
decreasing function of last period’s con- 
sumption: it takes more consumption to- 
day to make him happy if he consumed 
more yesterday. 

The individual’s marginal utility with 
respect to period t consumption is given 


by the formula: 
MU, = (c; — Mci)” — BAE Cni — Ae). (6) 


Thus, the marginal utility of period t 
consumption is an increasing function of 
period (t — 1) consumption. Note that 
there are two pieces to this formula for 
marginal utility. The first term captures 
the fact that if I buy a BMW rather than 
a Yugo today, then I am certainly better 
off today; however, the second piece 
models the notion that having the BMW 
“spoils” me and reduces my utility from 
all future car purchases. 

The individual’s optimal consumption 
portfolio must satisfy the two first order 
conditions that make him indifferent to 
buying or selling more stocks or bonds: 


BE,{(MU,,.,/MU,)(R§,, — Rby)}=0 (7a) 


BE{(MU,,;/MU,R?,1)} =1 (7b) 


where MU; is defined as in (6). We can 
apply the Law of Iterated Expectations 
to obtain: 


BE((MU,4:/MU,)(Rtsa — RP) = 0 (7a’) 
BE{(MU,,,/MU,)R?,, b=], (7b’) 
The problem with trying to satisfy 

(7a’) and (7b’) is that MU, depends on 

the investor’s ability to predict future 

consumption growth. As with GEU pref- 


erences, this means that the formula for 
the marginal utility depends on the indi- 
vidual’s (possibly unobservable) informa- 
tion. As in that context, it is convenient 
to assume that consumption growth from 
period t to period (t + 1) is unpre- 
dictable. Then the ratio of marginal utili- 
ties is given by: 


MU,„/MU, = (Q19: — AQ) 
- ABCP CECE — A) 
{(@, -A — ABCE — AY} (8) 


where = C/C;-1. This allows us to eas- 
ily estimate MU;+1/M U; by using the sam- 
ple mean to form an estimate of the un- 
conditional expectations in the formula 
(8). 

With the preference class (1), it is not 
possible to simultaneously satisfy equa- 
tions (2a’) and (2b’) without driving B 
above one. This is not true of prefer- 
ences that exhibit habit formation. For 
example, suppose we set B = 0.99. Un- 
like the preference class (1), it is then 
possible to find preference parameters to 
satisfy both (7a’) and (7b’). In particular, 
if we set œ = 15.384 and A = 0.174, then 
the sample analogs of both first order 
conditions are exactly satisfied in the 
data.14 


14 Instead of (7a’,b’), Wayne Ferson and Con- 
stantinides (1991) work with an alternative impli- 
cation of (7a,b): 


E{MU:Z/(Ci-AC1-1)*)} 
= BE{M Um R1 ZA(C AC- 


where R;+1 is the gross return to an arbitrary asset, 
and Zr is an arbitrary variable observable to the 
econometrician and to the agent at time t. This 
condition has the desirable property that it does 
not involve terms that are unobservable to the 
econometrician. However, (*) is a different type of 
implication of optimal behavior from (4a,b) or 
(2a,b) because it is not just an averaged version of 
(7a,b) (even when Zz is set equal to 1). In order to 
be consistent across preference orderings, I work 
with (7a,b). 

In the context of a calibrated general equilib- 
rium model, Constantinides (1990) generates a 
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There are two implications of this 
result. First, habit formation does not 
resolve the equity premium puzzle: it 
is still true that the representative in- 
vestor is indifferent between stocks and 
bonds only if she is highly averse to 
consumption risk. On the other hand, 
habit formation does help resolve the 
risk free rate puzzle—it is possible to 
satisfy (7a’) and (7b’) without setting B 
larger than one. The intuition behind 
this finding is simple. For any given level 
of current consumption, the individual 
knows that his desire for consumption 
will be higher in the future because con- 
sumption is habit forming; hence, the in- 
dividual’s demand for savings increases 
relative to the preference specification 


(1). 


3. Relative Consumption: “Keeping 
up with the Joneses” 


The standard preferences (1) assume 
that individuals derive utility only from 
their own consumption. Suppose, 
though, that as James Duesenberry 
(1949) posits, an individual’s utility is a 
function not just of his own consumption 
but of societal levels of consumption. 
Then, his investment decisions will be 
affected not just by his attitudes toward 
his own consumption risk, but also by his 
attitudes toward variability in societal 
consumption. Following up on this intui- 
tion, Abel (1990) and Jordi Gali (1994) 
examine the asset pricing implications of 
various classes of preferences in which 
the utility an individual derives from a 


large equity premium and a low risk free rate by 
assuming that the eee agent’s_ utility 
function has a large value of A and a low value of 
a. However, it is important to keep in mind that 
when A is large, the individual requires a large 
level of consumption in order to just survive; he 
will pay a lot to avoid small consumption gambles 
even if æ is low. Thus, Constantinides’ proposed 
resolution of the puzzles requires individuals to be 
very averse to ae risk (although, as he 
shows, not to wealth risk). 


given amount of consumption depends 
on per capita consumption.!° 

I will work with a model that combines 
features of Abel’s and Gals formula- 
tions. Suppose the representative indi- 
vidual has preferences in period t: 


E, ELOR chech Ch 3/00), a20 (9) 


where cus is the individual’s level of con- 
sumption in period (t + s) while Cs is 
the level of per capita consumption in 
period (t + s) (in equilibrium, of course, 
the two are the same). Thus, the individ- 
ual derives utility from how well she is 
doing today relative to how well the av- 
erage person is doing today and how well 
the average person did last period. If the 
individual is a jealous sort, then it is 
natural to think of y and A as being nega- 
tive: the individual is unhappy when oth- 
ers are doing well. If the individual is 
patriotic, then it is natural to think of y 
and A as being positive: the individual is 
happy when per capita consumption for 
the nation is high. 

The representative individual treats 
per capita consumption as exogenous 
when choosing how much to invest in 
stocks and bonds. Hence, in equilibrium, 
the individual’s first order conditions 
take the form: 


E {(Cr/C)e(C/C,)( Ris 


-Rb ı)}=0 (10a) 


BE A(Ci1/C)* C/C -)Rb a} =1. (10b) 


We can apply the law of iterated expec- 
tations to derive the two conditions: 


E{(Cy/C) UC /C -DMR 


- RP,)}=0 (108) 


BE((Cy41/C,)* (CYC) R24} = 1. (10b’) 

15See Campbell and Cochrane (1995) and 
James Nason (1988) for models which feature both 
relative consumption effects and time-varying risk 
aversion. 
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For any specification of the discount 
factor B and the coefficient of relative 
risk aversion Q, it is possible to find set- 
tings for y and A such that the puzzles 
are resolved in that the sample analogs 
of (10a’) and (10b’) are satisfied. For ex- 
ample, suppose B = 0.99. Then, if (a — y) 
= 19.280, and À = 3.813, the sample ana- 
logs of (10a’) and (10b’) are satisfied ex- 
actly. 

This model offers the following expla- 
nation for the seemingly large equity 
premium. When y is large in absolute 
value, an individual’s marginal utility of 
own consumption is highly sensitive to 
fluctuations in per capita consumption 
and therefore strongly negatively related 
to stock returns. Thus, even if @ is small 
so that the “representative” investor is 
not all that averse to individual consump- 
tion risk, she does not find stocks attrac- 
tive because she is highly averse to per 
capita consumption risk. 

However, the presence of contempora- 
neous per capita consumption in the rep- 
resentative investor’s utility function 
does not help explain the risk free rate 
puzzle: if A is set equal to zero, (10a’) 
and (10b’) become equivalent to (2a’) 
and (2b’). We know from our analysis of 
those equations that if A = 0, it is neces- 
sary to drive B larger than one in order 
to satisfy (10b’) given that (10a’) is satis- 
fied. The positive effect of lagged per 
capita consumption on current marginal 
utility increases the individual’s demand 
for savings and allows (10a’) to be satis- 
fied with a relatively low value of B. 


4. Summary 


Mehra and Prescott (1985) require the 
preferences of the representative indi- 
vidual to lie in the “standard” class (1). 
We have seen that these preferences can 
be made consistent with the large equity 
premium only if the coefficient of rela- 
tive risk aversion is pushed near 20. 
Given the low level of the average Trea- 


sury bill return, individuals who see con- 
sumption in different periods as so com- 
plementary will be happy with the steep 
consumption profile seen in United 
States data only if their “discount” factor 
is above one. 

Several researchers have explored the 
consequences of using broader classes of 
preferences. Broadly speaking, there are 
two lessons from this type of sensitivity 
analysis. First, the risk free rate puzzle 
can be resolved as long as the link be- 
tween individual attitudes toward risk 
and growth contained in the standard 
preferences (1) is broken. Second, the 
equity premium puzzle is much more ro- 
bust: individuals must either be highly 
averse to their own consumption risk or 
to per capita consumption risk if they are 
to be marginally indifferent between in- 
vesting in stocks or bonds. 

It seems that any resolution to the eq- 
uity premium puzzle in the context of a 
representative agent model will have to 
assume that the agent is highly averse to 
consumption risk. Per capita consump- 
tion is very smooth, and therefore does 
not covary greatly with stock returns. Yet 
people continue to demand a high ex- 
pected return for stocks relative to 
bonds. The only possible conclusion is 
that individuals are extremely averse to 
any marginal variation in consumption 
(either their own or societal). 


IV. Incomplete Markets and Trading Costs 


In this section, I examine the ability of 
models with incomplete markets and 
various sorts of trading frictions to ex- 
plain the asset returns data.!® Unlike the 


16 Throughout my discussion, I ignore the exist- 
ence of fiat money. Pamela Labadie (1989) and 
Alberto Giovannini and Labadie (1991) show that 
motivating a demand for money via a cash-in- 
advance constraint does not significantly alter the 
asset pricing implications of the Mehra-Prescott 
(1985) model. However, a richer model of money 
demand might have more dramatic effects on asset 
prices. 
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previous section, which focused solely on 
the decision problem of a “representa- 
tive” agent, the analysis will (necessarily) 
be fully general equilibrium in nature. 
Because of the complexity of these dy- 
namic general equilibrium models, their 
implications are generated numerically; I 
find that this makes any intuition more 
speculative in nature. 

Throughout this section, all investors 
are assumed to have identical prefer- 
ences of the form (1). The “game” in this 
literature is to explain the existence of 
the puzzles solely through the presence 
of incompleteness and trading costs that 
Mehra and Prescott assume away (just as 
in Section III, the “game” was to explain 
the puzzle without allowing such fric- 
tions). 


1. Incomplete Markets 


Underlying the Mehra-Prescott model 
is the presumption that the behavior of 
per capita consumption growth is a good 
guide to the behavior of individual con- 
sumption growth. We have seen that this 
belief is warranted if asset markets are 
complete so that individuals can write 
contracts against any contingency: indi- 
viduals will use the financial markets to 
diversify away any idiosyncratic differ- 
ences in their consumption streams. As a 
result, their consumption streams will 
look similar to each other and to per cap- 
ita consumption. 

However, in reality, it does not appear 
easy for individuals to directly insure 
themselves against all possible fluctua- 
tions in their consumption streams. (For 
example, it is hard to directly insure one- 
self against fluctuations in labor income.) 
For this reason, most economists believe 
that insurance markets are incomplete. 
Intuitively, in the absence of these kinds 
of markets, individual consumption 
growth will feature risk not present in 
per capita consumption growth and so 
individual consumption growth will be 


more variable than per capita consump- 
tion growth. As a result, models with in- 
complete markets offer the hope that 
while the covariance of per capita con- 
sumption growth with stock returns is 
small, individual consumption growth 
may covary enough with stock returns to 
explain the equity premium. 

Dynamic Self-insurance and the Risk 
Free Rate. The intuitive appeal of in- 
complete markets for explaining asset re- 
turns data is supported by the work of 
Weil (1992). He studies a two period 
model in which financial markets are in- 
complete. He shows that if individual 
preferences exhibit prudence (that is, 
convex marginal utility—see Miles Kim- 
ball 1990), the extra variability in indi- 
vidual consumption growth induced by 
the absence of markets helps resolve the 
risk free rate puzzle. Without complete 
markets, individuals must save more in 
order to self-insure against the random- 
ness in their consumption streams; the 
extra demand for savings drives down the 
risk free rate. Weil also shows that if in- 
dividuals exhibit not just prudence but 
decreasing absolute prudence (see Kim- 
ball 1990), the additional variability in 
consumption growth induced by market 
incompleteness also helps to explain the 
equity premium puzzle. The extra riski- 
ness in individual consumption makes 
stocks seem less attractive to an individ- 
ual investor than they would to a “repre- 
sentative” consumer.!* 

Unfortunately, two period models 
cannot tell the full story because they 
abstract from the use of dynamic trad- 
ing as a form of insurance against risk. 
For example, suppose my salary next 
year is equally likely to be $40,000 or 


17 Mankiw (1986) shows that if the conditional 
variability of individual income shocks is higher 
when the aggregate state of the economy is lower, 
then prudence alone is sufficient to generate a 
higher equity premium in the incomplete markets 
environment. 
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$50,000. If I die at the end of the year 
and do not care about my descendants, 
then the variability in income has to be 
fully reflected in my expenditure pat- 
terns. This is the behavior that is cap- 
tured by two period models. On the 
other hand, if I know I will live for many 
more years, then I need not absorb my 
income risk fully into current consump- 
tion—I can partially offset it by reducing 
my savings when my income is high and 
increasing my savings when my income is 
low. 

To understand the quantitative impli- 
cations of dynamic self-insurance for the 
risk free rate, it is best to consider a sim- 
ple model of asset trade in which there 
are a large number (in fact, a continuum) 
of ex ante identical infinitely lived con- 
sumers. Each of the consumers has a 
random labor income and their labor in- 
comes are independent of one another 
(so there is no variability in per capita 
income). The consumers cannot directly 
write insurance contracts against the 
variability in their individual income 
streams. In this sense, financial markets 
are incomplete. However, the individuals 
are able to make risk free loans to one 
another, although they cannot borrow 
more than B in any period. 

For now, I want to separate the ef- 
fects of binding borrowing constraints 
from the effects of incomplete markets. 
To eliminate the possibility that the 
debt ceiling B is ever a binding con- 
straint, I assume throughout the follow- 
ing discussion that B is larger than 
Ymin/r, Where yin is the lowest possible 
realization of income. No individual will 
borrow more than ymin/r; doing so would 
mean that with a sufficiently long run of 
bad income realizations, the individual 
would be forced into default, which is as- 
sumed to be infinitely costly (see Aiya- 
gari 1994). Market clearing dictates that 
for every lender, there must be a bor- 
rower; hence, ymin must be larger than 


zero if there is to be trade in equilib- 
rium. 15 

As in the two period context, an indi- 
vidual in this economy faces the possibil- 
ity of an uninsurable consumption fall. 
The desire to guard against this possibil- 
ity (at least on the margin) generates a 
greater demand to transfer resources 
from the present to the future than in a 
complete markets environment. This ex- 
tra demand for saving forces the equilib- 
rium interest rate below the complete 
markets interest rate.19 

However, the ability to dynamically 
self-insure in the infinite horizon setting 
means that the probability of an uninsur- 
able consumption fall is much smaller 
than in the two period context. In equi- 
librium, the “typical” individual has no 
savings (because net asset holdings are 
zero), but does have a “line of credit” 
Ymin/r. The individual can buffer any 
short-lived falls in consumption by bor- 
rowing; his line of credit can be ex- 
hausted only by a relatively unlikely long 
“run” of bad income realizations. 
(Equivalently, in a stationary equilib- 
rium, few individuals are near their 
credit limit at any given point in time.) 


18 Christopher Carroll (1992) argues using data 
from the Panel Study in Income Dynamics that 
ymin equals zero. In this case, no individual can 
ever borrow (because their debt ceiling equals 
zero). There can be trade of risk free bonds only if 
there is an outside supply of them (say, by the 
government). Then, the risk free rate puzzle be- 
comes an issue of how low the amount of outside 
debt must be in order to generate a plausibly low 
value for the risk free rate. There is no clean an- 
swer to this question in the existing literature. 

19 Marilda Sotomayor (1984) proves that the in- 
dividual demand for asset holdings and for con- 
sumption grows without bound over time if the 
interest rate equals the rate of time preference— 
the complete markets interest rate. (Her result is 
valid without the assumption of convex marginal 
utility as long as the consumption set is bounded 
from below.) This implies that the equilibrium in- 
terest rate must fall below the rate of time prefer- 
ence in order to deter individuals from saving so 
much. See Zeldes (1989b) and Angus Deaton 
(1992) for numerical work along these lines in a 
finite horizon setting. 
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Thus, the extra demand for savings gen- 
erated by the absence of insurance mar- 
kets will generally be smaller in the infi- 
nite horizon economy than in a two 
period model. 

This reasoning suggests that in the in- 
finite horizon setting, the absence of in- 
come insurance markets may have little 
impact on the interest rate. The numeri- 
cal work of Huggett (1993) and Heaton 
and D. Lucas (1995b) confirms this in- 
tuition. They examine infinite horizon 
economies which are calibrated to ac- 
cord with aspects of United States data 
on individual labor income and find that 
the difference between the incomplete 
markets interest rate and the complete 
markets interest rate is small.2° 

The above discussion assumes that in- 
dividual income shocks die out eventu- 
ally (that is, the shocks are stationary). 
Constantinides and Darrell Duffie 
(1995) point out that the story is very 


20In fact, they obtain these results in econo- 
mies in which some fraction of the agents face 
binding borrowing constraints in equilibrium. Re- 
laxing these borrowing constraints or adding out- 
side debt will serve to further increase the equilib- 
rium interest rate toward the complete markets 
rate (Aiyagari 1994). 

There is a caveat associated with the results of 
Huggett (1993) and Heaton and D. Lucas 
(1995b)—and with most of the other numerical 
work described later in this paper. For computa- 
tional reasons, these papers assume that the pro- 
cess generating individual income is discrete. This 
discretization may impose an artificially high lower 
bound on income—that is, an artificially high 
value of ymin Thus, Heaton and D. Lucas (forth- 
coming) assume that income cannot fall below 75 
percent of its mean value. Huggett (1993) is more 
conservative in assuming that income cannot fall 
below about ten percent of its mean value. In both 
cases, finer and more accurate discretizations may 
drive y pin lower. 

This technical detail may have important conse- 
quences for the numerical results because no indi- 
vidual can ever borrow more than ymin". As Y min 
falls, there is an increased probability of a fall in 
consumption that cannot be “buffered” by using 
accumulated savings or by borrowing, and the 
downward pressure on the equilibrium interest 
rate rises. In fact, as ymin goes to zero, the equilib- 
rium interest rate must go to zero (if there is no 
outside debt). 


different when shocks to individual labor 
income are permanent instead of transi- 
tory. Under the assumption of perma- 
nence, dynamic self-insurance can play 
no role: income shocks must be fully ab- 
sorbed into consumption. For example, 
if my salary falls by $10,000 perma- 
nently, then the only way to keep my 
consumption smooth over time is reduce 
it by the same $10,000 in every period: I 
cannot just temporarily run down my 
savings account to smooth consumption. 
Because income shocks must be fully ab- 
sorbed into consumption, the results 
from the two period model become 
much more relevant. Indeed, Constan- 
tinides and Duffie (1995) show that the 
absence of labor income insurance mar- 
kets, combined with the permanence of 
labor income shocks, has the potential to 
generate a risk free rate that may be 
much lower than the complete markets 
risk free rate. 

The issue, then, becomes an empirical 
one: how persistent (and variable) are 
otherwise undiversifiable shocks to indi- 
vidual income? This is a difficult matter 
to sort out because of the paucity of time 
series evidence available. However, Hea- 
ton and D. Lucas (1995a) use data from 
the Panel Study of Income Dynamics 
and estimate the autocorrelation of idio- 
syncratic?! labor income shocks to be 
around 0.5. When they examine the pre- 
dictions of the above model using this 
autocorrelation for labor income (or 
even an autocorrelation as high as 0.8), 
they find that the equilibrium interest 


21 What matters is the persistence of the undi- 
versifiable component of labor income. Heaton 
and Lucas (1995a) implicitly assume that individu- 
als can hedge any labor income risk that is corre- 
lated with aggregate consumption. Hence, they 
obtain their autocorrelation estimate of 0.5 only 
after first controlling for aggregate effects upon 
individual labor income (this may explain why 
their estimate of persistence is so much lower than 
that of Glenn Hubbard, onetan Skinner, and 
Zeldes, 1994, who control for aggregate effects 
only through the use of year dummies). 
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rate is close to the complete markets 
rate. 

Dynamic Self-insurance and the Eq- 
uity Premium Puzzle. The above analysis 
shows that as long as most individuals 
are sufficiently far from the debt ceiling 
Ymin/?, dynamic self-insurance implies 
that the equilibrium interest rate in in- 
complete markets models is well-ap- 
proximated by the complete markets in- 
terest rate. In this section, I claim that a 
similar conclusion can be reached about 
the equity premium as long as the inter- 
est rate and the persistence of shocks are 
both sufficiently low. 

The following argument is a crude in- 
tuitive justification for this claim, based 
largely on the Permanent Income Hy- 
pothesis. Consider an infinitely lived in- 
dividual who faces a constant interest 
rate r and who faces income shocks that 
have a positive autocorrelation equal to 
p; if p = 1, the shocks are permanent 
while if p < 1, their effect eventually dies 
out. Financial markets are incomplete in 
the sense that the individual cannot ex- 
plicitly insure against the income shocks. 
Because of concave utility, the individual 
wants to smooth consumption as much as 
possible. 

Suppose the individual’s income is sur- 
prisingly higher in some period by 6 dol- 
lars. Because of the autocorrelation in 
the income shocks, the present value of 
his income rises by: 


8+ pd/(lt+r)+p%/1t+r)?t+.... 
= -p/(1+7r)) 
=8(1l+r/(1+r—p). 
The individual wants to smooth con- 
sumption. Hence, he increases consump- 
tion in every period of life by the annu- 


itized value of the increase in his wealth. 
A simple present value calculation im- 


plies that his consumption rises by 6r/(1 


+r-—p) dollars. 
If the individual were able to fully in- 


sure against all income shocks, then his 
consumption would not change at all in 
response to the surprise movement in his 
income process. Thus, his ability to self- 
insure against income shocks using only 
savings can be measured by how close 
r/(1 +r — p) is to zero. For example, if p 
= l, and all shocks are permanent, then 
r/(1 +r — p) is equal to one; the income 
shock is fully absorbed into consump- 
tion, which is very different from the im- 
plications of the full insurance model. In 
contrast, if p < 1, then as r nears zero, 
the implications of the full insurance 
model and the incomplete markets mod- 
els for movements in individual con- 
sumption become about the same. In the 
United States annual data, we have seen 
that r is around one percent; hence, even 
if p is as high as 0.75, consumption is 
fairly insensitive to surprise movements 
in labor income. (Recall that Heaton and 
D. Lucas, 1995a, estimate the autocorre- 
lation of undiversifiable income shocks 
to be 0.5.) 

This intuitive argument is borne out 
by the numerical work of Christopher 
Telmer (1993) and D. Lucas (1994: see 
also Heaton and D. Lucas 1995a, 1995b; 
Wouter den Haan 1994; and Albert Mar- 
cet and Singleton 1991). These papers 
examine the quantitative predictions of 
dynamic incomplete markets models, 
calibrated using individual income data, 
for the equity premium. They find that 
dynamic self-insurance allows individuals 
to closely approximate the allocations in 
a complete markets environment. The 
equilibrium asset prices are therefore 
similar to the predictions of the Mehra- 
Prescott (1985) model. 

A critical assumption underlying this 
numerical work is that income shocks 
have an autocorrelation less than one. 
Constantinides and Duffie (1995) show 
that if labor income shocks are instead 
permanent, then it is possible for incom- 
plete markets models to explain the large 
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size of the equity premium. As in the 
case of the risk free rate puzzle, the 
failure of dynamic self-insurance when p 
=1 means that the model’s implications 
resemble those of a two period frame- 
work. 

To sum up: the assumption that finan- 
cial markets are complete strikes many 
economists as prima facie ridiculous, and 
it is tempting to conclude that it is re- 
sponsible for the failure of the Mehra- 
Prescott model to explain the United 
States data on average stock and bond 
returns. However, as long as investors 
can costlessly trade any financial asset 
(for example, risk free loans) over time, 
they can use the accumulated stock of 
the asset to self-insure against idiosyn- 
cratic risk (for example, shocks to indi- 
vidual income). The above discussion 
shows that given this ability to self-in- 
sure, the behavior of the risk free rate 
and the equity premium are largely unaf- 
fected by the absence of markets as long 
as idiosyncratic shocks are not highly 
persistent. 


2. Trading Costs 


In the incomplete and complete mar- 
kets models described above, it is as- 
sumed that individuals can costlessly 
trade any amount of the available securi- 
ties. This assumption is unrealistic in 
ways that might matter for the two puz- 
zles. For example, in order to take ad- 
vantage of the possible utility gains asso- 
ciated with the large premium offered by 
equity, individuals have to reduce their 
holdings of Treasury bills and buy more 
stocks. The costs of making these trades 
may wipe out their apparent utility bene- 
fits. In this subsection, I consider the im- 
plications for asset returns of various 
types of trading costs. 

Borrowing and Short Sales Con- 
straints. Consider the familiar two pe- 
riod model of borrowing and lending in 
most undergraduate microeconomic text- 


books. In this model, the individual’s 
budget set of consumption choices (c1, 
Cg) is written as: 


c1syitbi 
c2+bı(l +r)<y2 
c120,c220 


where bı is the level of borrowing in pe- 
riod 1. The budget set allows the individ- 
ual to borrow up to the present value of 
his period two income. 

Many economists believe that this 
model of borrowing and lending ignores 
an important feature of reality: because 
of enforcement and adverse selection 
problems, individuals are generally not 
able to fully capitalize their future labor 
income. One way to capture this view of 
the world is to add a constraint to the 
individual’s decision problem, bı < B, 
where B is the limit on how much the 
individual can borrow. If B is less than 
yo/(1 + r), the constraint on bı may be 
binding because the individual may not 
be able to transfer as much income from 
period two to period one as he desires. 

The restriction bı < B is called a bor- 
rowing constraint. (Short sales con- 
straints are similar restrictions on the 
trade of stocks.) The law of supply and 
demand immediately implies that the 
equilibrium interest rate is generally 
lower if many individuals face a binding 
borrowing constraint than if few indi- 
viduals do. Intuitively, “tighter” borrow- 
ing constraints exogenously reduce the 
size of the borrowing side of the market; 
in order to clear markets, interest rates 
must fall in order to shrink the size of 
the lending side of the market. 

In keeping with this intuition, the 
work of Huggett (1993) and Heaton and 
D. Lucas (1995a, 1995b) demonstrates 
numerically that if a sizeable fraction of 
individuals face borrowing constraints, 
then the risk free rate may be substan- 
tially lower than the predictions of the 
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representative agent model. Using data 
from the Panel Study on Income Dynam- 
ics, Zeldes (1989a) documents that the 
large number of individuals with low lev- 
els of asset holdings appear to face such 
constraints. 

However, Heaton and D. Lucas 
(1995a, 1995b) show that borrowing and 
short sales constraints do not appear to 
have much impact on the size of the 
equity premium. There is a sound intui- 
tion behind these results: an individual 
who is constrained in the stock market 
generally must also be constrained in the 
bond market (or vice versa). Otherwise, 
the individual could loosen the con- 
straint in the bond market by shifting re- 
sources to it from the stock market. 
Hence, just as the average risk free rate 
must fall to clear the bond market when 
many individuals are constrained, so 
must the average stock return fall in or- 
der to clear the constrained stock mar- 
ket.22 

Transaction Costs. Individuals who try 
to engage in asset trade quickly find that 
they face all sorts of transaction costs. 
(These include informational costs, bro- 
kerage fees, load fees, the bid-ask 
spread, and taxes.) To understand how 
transaction costs affect the pricing of se- 
curities, consider an investor in a risk 
free world who can invest in two differ- 
ent securities, stocks and bonds. Stocks 
pay a constant dividend equal to d, and 
have a constant price equal to p. Because 
stocks are a perpetuity, their rate of re- 
turn r, ignoring transaction costs, equals 
d/p. Bonds are short-lived and pay a con- 
stant interest rate rp. There is no cost 
associated with trading bonds. However, 
stocks are costly to trade in that a buyer 


22 Following Hua He and David Modest (1995), 
Cochrane and Hansen (1992) show that this argu- 
ment is exactly correct when individuals face a 
market wealth constraint that restricts the total 
value of their asset portfolio not to fall below some 
exogenously specified (negative) number. 


of stocks must incur a cost pt , in addi- 
tion to the price p, to buy a share. 

In equilibrium, the cost of buying a 
share of stock cannot be smaller than the 
present value of the benefits of owning 
that share for N periods (or there would 
be excess demand for stocks because 
everyone would want to buy shares). 
This statement can be expressed mathe- 
matically as: 

pA +t) 2 d(l + rb) + d/(1 +r)" 

+... +d +r +p/1 + rb). 


This present value expression can be re- 
written as: 


r= d/p <(1+0r,/{1-— (1+ rp}. 


Note that the right hand side gets 
smaller as N grows large. Intuitively, as 
the investor holds the stock for a long 
period of time, he is able to more fully 
amortize the initial trading cost; the indi- 
vidual is therefore only prevented from 
making arbitrage profits if stock returns 
are extremely close to those of bonds. 
Indeed, when N equals infinity, the up- 
per bound on the premium paid by 
stocks is very tight: 


r, = Tp ST. 


Thus, if r, = 1 percent, and investors can 
buy stocks and hold them forever, then 
r, can be as large as seven percent in 
equilibrium only if t is greater than 600 
percent! 

In the above risk free model, an inves- 
tor could buy and hold a stock for an in- 
finite number of periods. However, if 
the investor instead faces income risk 
which cannot be directly insured, she is 
not able to plan on being able to hold 
stocks forever. The reason is simple: if 
she is ever pushed up against a debt ceil- 
ing, so she cannot borrow to fully offset 
bad income shocks, she will have to sell 
the stocks in order to smooth her con- 
sumption. This cap on the investor's 


“holding period” generated by potential 
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runs of bad luck raises the possibility 
that in models with missing income in- 
surance markets, transaction costs could 
have big effects on asset return spreads. 

However, from our discussion of cali- 
brated incomplete markets models, we 
know that an infinitely lived individual 
can do a pretty good job of smoothing 
consumption by simply buying and sell- 
ing the asset that is cheaper to trade. 
Hence, the agent faces only a low prob- 
ability of needing to sell stocks in order 
to smooth consumption. She can conse- 
quently contemplate making arbitrage 
profits by buying and holding stocks for 
a long period of time; as we have seen, 
this means that stocks cannot pay a 
much higher return than bonds in equi- 
librium. 

Thus, as Aiyagari and Mark Gertler 
(1991) and Heaton and D. Lucas (1995a) 
find, the only way to explain the equity 
premium using transaction costs is to as- 
sert that there are significant differences 
in trading costs across the stock and 
bond markets. In my view, there is little 
evidence to support this proposition at 
present, although both Aiyagari and 
Gertler (1991) and Heaton and D. Lucas 
(1995b) make some attempts in this di- 
rection. To be considered as the leading 
explanation of the puzzle, more needs to 
be done to document the sizes and 
sources of trading costs (especially for 
pension funds and other institutional in- 
vestors that operate on behalf of stock- 
holders).28 

Market Segmentation. A key aspect of 
the Mehra-Prescott (1985) model is that 


23In their partial equilibrium settings, Erzo 
Luttmer (1995) and He and Modest (1995) assume 
that all investors fully turn over their port- folios 
relatively frequently (once a month or once a 
quarter) and all at the same time. This assumption 
allows them to explain the equity premium with 
relatively small transaction costs; however, it 
seems difficult to build a general equilibrium 
model in which investors would choose to incur 
trading costs so frequently and so simultaneously. 


it treats per capita consumption growth 
as a good proxy for individual consump- 
tion growth. Yet, Mankiw and Zeldes 
(1991) and Michael Haliassos and Carol 
Bertaut (forthcoming) document that 
only about 30 percent of individuals in 
the United States own stocks (either di- 
rectly or through defined contribution 
pension funds). This suggests the possi- 
bility of market segmentation: that for 
whatever reason, only a subset of inves- 
tors are actively involved in asset trade. 
Campbell (1993) points out that it is the 
per capita consumption growth of the ac- 
tive traders that should be substituted 
into first order conditions like (2a) and 
(2b), not overall per capita consumption 
growth as Mehra and Prescott (1985) 
use. 

However, Mankiw and Zeldes (1991) 
provide direct evidence that market seg- 
mentation alone cannot explain the eq- 
uity premium. They find that the con- 
sumption growth of stockholders does 
covary considerably more with stock re- 
turns than the consumption growth of 
nonstockholders. However, the covari- 
ance is still not large enough: it is only if 
they are highly risk averse that stock- 
holders are marginally indifferent be- 
tween stocks and bonds. There is still an 
equity premium puzzle when we look at 
the consumption of those involved in as- 
set trade.?4 


24 Ingram (1990) examines the general equilib- 
rium implications of a model in which half the 
agents use all possible information to form their 
portfolios, while the other half hold an optimal 
“nonchurning” portfolio: they are constrained not 
to change the split of assets between stocks and 
bonds in response to new information. She finds 
that the equity premium is no higher than if all 
agents were engaged in asset trade, but the risk 
free rate is significantly lower. Presumably, the 
“rule of thumb” traders hold too many bonds in 
their portfolio because they want to be sure that 
they are insured against “runs” of bad income re- 
alizations. This drives the equilibrium interest rate 
downwards, but has little effect on the equity pre- 
mium. 
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3. Summary 


The Mehra-Prescott (1985) model as- 
sumes that asset trade is costless and 
that asset markets are complete. Casual 
observation suggests that both assump- 
tions are unduly strong. Unfortunately, 
the predictions of the model for the eq- 
uity premium and the risk free rate are 
not greatly affected by allowing markets 
to be incomplete: dynamic self-insurance 
allows individuals to smooth idiosyn- 
cratic shocks without using explicit insur- 
ance contracts. It is true (by the law of 
supply and demand) that a frequently 
binding borrowing constraint forces the 
average risk free rate down toward its 
sample value. The only way to simultane- 
ously generate a sizeable equity pre- 
mium, though, is to make trading in the 
stock market substantially more costly 
than in the bond market. Under this as- 
sumption, the premium on stocks repre- 
sents compensation not for risk (as in the 
standard financial theory), but rather for 
bearing additional transactions costs. 
The sources of these additional costs re- 
main unclear. 

Despite its general lack of success in 
explaining the equity premium, this lit- 
erature has made an important contribu- 
tion to our understanding of trade in 
markets with frictions. Just as people in 
real life are able to find ways around 
government regulation of the economy, 
rational individuals in models are able to 
find ways around barriers to trade. It 
takes an enormous amount of sand in the 
gears to disrupt the ability of individuals 
to approximate an efficient allocation of 
resources. 

There is, however, a continuing weak- 
ness in the “market frictions” literature. 
Usually, the incompleteness of markets 
and the costliness of trade in these mod- 
els is motivated by adverse selection or 
moral hazard considerations. Yet, this 
motivation remains informal. From a 


theoretical point of view (and possibly an 
empirical one as well), the models will 
be stronger if they explicitly take into ac- 
count the informational problems that 
lead to the trading frictions. 


V. Discussion and Conclusions 


This paper began by posing two puz- 
zles: how can we explain why average 
stock returns are so much higher than 
bond returns in the United States data, 
and why has per capita consumption 
grown so quickly given that bond returns 
are so low? As it turns out, there are a 
variety of ways to generate more savings 
demand than in the Mehra-Prescott 
model and thereby explain the latter 
phenomenon. 

However, the equity premium puzzle 
is much more challenging. Throughout 
this article, we have seen only two ways 
to explain the wedge in average returns 
between stocks and bonds. The first is 
that there is a large differential in the 
cost of trading between the stock and 
bond markets. To make this explanation 
compelling, it is important to ascertain 
the size of actual trading costs in these 
markets, and to provide an explanation 
of why those costs exist. Heaton and D. 
Lucas (1995b) and Aiyagari and Gertler 
(1991) discuss some early steps in this 
direction. 

The second explanation is that, con- 
trary to Mehra and Prescott’s (1985) 
original parametric restrictions, individ- 
ual investors have coefficients of relative 
risk aversion larger than ten (either with 
respect to their own consumption or 
with respect to per capita consump- 
tion).2° As I explained earlier, the prob- 
lem with this explanation is that only a 


25 Of course, as mentioned above, to resolve the 
risk free rate puzzle, we cannot only adopt a more 
flexible notion of what constitutes reasonable risk 
aversion; we must abandon the standard prefer- 
ences (1) or impose borrowing constraints that 
bind frequently. 
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handful of economists believe that indi- 
viduals are that risk averse. One way to 
support the “high risk aversion” view is 
to demonstrate that this apparently 
“strange” assumption about human be- 
havior is consistent with data other than 
the average realization of the equity pre- 
mium. Until now, little has been done 
along these lines, but Tallarini’s (1994) 
analysis of a “prototypical” real business 
cycle model using generalized expected 
utility preferences represents a promis- 
ing first step. 

Ten years ago, Mehra and Prescott 
(1985) wrote in their conclusion that the 
relatively low rate of return of Treasury 
bills, “is not the only example of some 
asset receiving a lower return than that 
implied by Arrow-Debreu general equi- 
librium theory. Currency, for example, is 
dominated by Treasury bills with positive 
nominal yields yet sizable amounts of 
currency are held.” Thus, they regard 
the equity premium puzzle as being 
analogous to the so-called “rate of return 
dominance” puzzle that motivates much 
of modern monetary theory. 

I believe that the past decade of work 
has made this analogy even more persua- 
sive: I find that the suggested “solutions” 
to the equity premium puzzle closely re- 
semble the approaches used by econo- 
mists to generate a demand for money in 
the face of rate of return dominance. 
Thus, the “high risk aversion” story ex- 
plains the puzzle by some peculiar aspect 
of individual preferences—which is ex- 
actly the way money-in-the-utility-func- 
tion models explain money demand. The 
“transactions costs” story explains that 
bonds are held despite their low rate of 
return because they are less costly to 
trade—which is exactly the way transac- 
tions costs models explain money de- 
mand. 

Generally, both of these types of mod- 
els of money demand are regarded as ad 
hoc “reduced forms.” They fail to cap- 


ture the essential technological forces 
(for example, lack of communication) 
that disrupt the process of fully central- 
ized exchange and thereby generate a 
demand for money. This same complaint 
of superficiality can also be made of the 
two types of stories explaining the equity 
premium. 

In one sense, the accuracy of the 
money demand analogy is depressing: 
monetary theorists are a long way from 
delivering a definitive model of money 
demand. On the other hand, the work of 
Robert Townsend (1987) and others in 
monetary theory is exciting because it 
shows so clearly what it will take to make 
true progress in explaining the size of 
the equity premium. Like fiat money, 
the equity premium appears to be a 
widespread and persistent phenomenon 
of market economies. The universality of 
the equity premium tells us that, like 
money, the equity premium must 
emerge from some primitive and ele- 
mentary features of asset exchange that 
are probably best captured through ex- 
tremely stark models. With this in mind, 
we cannot hope to find a resolution to 
the equity premium puzzle by continuing 
in our current mode of patching the 
standard models of asset exchange with 
transactions costs here and risk aversion 
there. Instead, we must seek to identify 
what fundamental features of goods and 
asset markets lead to large risk adjusted 
price differences between stocks and 
bonds. While I have no idea what these 
“fundamental features” are, it is my be- 
lief that any true resolution to the equity 
premium puzzle lies in finding them. 


Appendix 


Apart from two exceptions, the follow- 
ing italicized description is taken directly 
from Mehra and Prescott (1985, pp. 
147-48). 

The data used in this study consists of 
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five basic series for the period 1889- 
1978. The first four are identical to those 
used by Grossman and Shiller (1981) in 
their study. The series are individually 
described below. 

(i) Series P: Annual average Standard 
& Poor’s Composite Stock Price Index di- 
vided by the Consumption Deflator, a 
plot of which appears in Grossman and 
Shiller (1981, p. 225, fig. 1). 

(ii) Series D: Real annual dividends for 
the Standard & Poor’s series. 

(iii) Series C: Kuznets-Kendrik-USNIA 
per capita real consumption on non-dur- 
ables and services. 

(iv) Series PC: Consumption deflator 
series, obtained by dividing real con- 
sumption in 1972 dollars on non-dur- 
ables and services by the nominal con- 
sumption on non-durables and services. 

(v) Series RF: Nominal yield on rela- 
tively riskless short-term securities over 
the 1889-1978 period; the securities used 
were ninety-day government Treasury 
bills in the 1931-1978 period, Treasury 
Certificates for the 1920-1930 period 
and sixty-day to ninety-day Prime Com- 
mercial Paper prior to 1920 [the data 
was obtained from Homer (1963) and 
Ibbotson and Sinquefeld (1978)]. 

These series were used to generate the 
series actually utilized in this paper. Se- 
ries P and D above were used to deter- 
mine the average annual real return on 
the Standard & Poor’s 500 Composite In- 
dex over the ninety-year period of study. 
The annual return for year t was com- 
puted as (Paa + Duy — PY/P,. The returns 
are plotted in fig. 2. Series C was used 
to determine the process on the growth 
rate of consumption over the same period 

. . A plot of the percentage growth rate 
of real consumption appear in fig. 1. To 
determine the real return on a relatively 
riskless security we used the series RF 
and PC. For year t, this is calculated 
to be (1 + RF)PC/PCy4). This series is 
plotted in fig. 3. 


This description differs from that of 
Mehra and Prescott (1985) in two ways. 
First, the figure numbers have been 
changed to match the numbering in this 
paper. Second, Mehra and Prescott 
(1985) used the formula RF, — (PCi41 — 
PC,)/PC, for the real return to the rela- 
tively riskless security. This formula ex- 
presses the real rate as being the differ- 
ence between the nominal rate and the 
inflation rate. Of course, their formula is 
only an approximation; I use the correct 
ratio formula instead. 

This difference in the formulae creates 
only small differences in results. For ex- 
ample, they find that the average rela- 
tively riskless rate is 0.8 percent. I find it 
instead to be one percent. They find that 
the variance of the relatively riskless rate 
is 0.0032; I find it to be 0.00308. 
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